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Defector-accelerated cooperativeness and punishment in public goods games with mutations 
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We study the evolution of cooperation in spatial public goods games with four competing strategies: cooper- 
ators, defectors, punishing cooperators, and punishing defectors. To explore the robustness of the cooperation- 
promoting effect of costly punishment, besides the usual strategy adoption dynamics we also apply strategy 
mutations. As expected, frequent mutations create kind of well-mixed conditions, which support the spreading 
of defectors. However, when the mutation rate is small, the final stationary state does not significantly differ 
from the state of the mutation-free model, independently of the values of the punishment fine and cost. Never- 
theless, the mutation rate affects the relaxation dynamics. Rare mutations can largely accelerate the spreading 
of costly punishment. This is due to the fact that the presence of defectors breaks the balance of power between 
both cooperative strategies, which leads to a different kind of dynamics. 
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Evolutionary game theory formalizes the dynamics of pop- 
ulations of interacting individuals, considering the success 
(payoff) of their interactions. While this approach has applica- 
tions in biology 1 1-5], economics |6], and the social sciences 
(TD, it has attracted a great deal of interest among physicists 
as well due to the relevance of methods from non-linear dy- 
namics [Sy, statistical physics [,90, cellular automata 1,10,1 . and 
many -particle simulations lUlll . 

One of the grand scientific challenges in this field con- 
cerns the question, how the outcome of interactions in so- 
cial dilemma situations can be improved. In social dilemmas 
such as the public goods game, the collective well-being de- 
pends on the cooperation of individuals, which however is un- 
likely, as selfish behavior can generate higher personal profits. 
It has been proposed that reputation and costly punishment 
can fight free-riding (defection) and promote cooperation in 
public goods situations [12]. It is puzzling, however, why 
people would make punishment efforts, as this reduces their 
payoffs compared to others who do not punish ("second-order 
free-riders"). In fact, punishing strategies disappear in public 
goods games, when the interactions between individuals are 
well mixed, creating again a tragedy of the commons. How- 
ever, when individuals have spatial neighborhood interactions, 
free-riders may be eliminated (both, conventional and second- 
order ones) lll3L[l4ll . This is due to the fact that the different 
cooperative strategies form clusters and segregate from each 
other In this way, punishing cooperators avoid to be exploited 
by second-order free-riders (non-punishing cooperators) and 
can efficiently fight against defectors. Adding strategy muta- 
tions, however, endangers homogeneous clusters of individu- 
als pursuing the same strategy, as they support the intrusion of 
competing strategies ("enemies"). For example, if defectors 
manage to enter a cooperative cluster, it can quickly erode. 
As a consequence, one would expect that mutations under- 
mine the spreading of punishing strategies, thereby restoring 
the "second-order free-rider problem" and the "tragedy of the 
commons". 

Therefore, this paper investigates the impact of strategy mu- 



tations on the evolution of cooperation in the spatial public 
goods game with punishing strategies. As interaction graph, 
we assume a square lattice. Punishment is introduced by 
means of two additional strategies besides cooperators (C) and 
defectors (D). These two strategies are punishing cooperators 
(PC) and punishing defectors (PD), both of which impose a 
fine on defectors at a personal cost. The public goods game 
is iteratively played on a fully occupied square lattice of size 
L X L with periodic boundary conditions, where each player 
X holds a strategy s^ G {C, D, PC, PD}. Initially, the four 
strategies are equally and uniformly distributed over the L^ 
lattice sites. Each player x is a member of G = 5 groups 
consisting of 5 individuals each. Each of these groups corre- 
sponds to a Neumann neighborhood of the focal individual or 
one of the direct neighbors. 

In each iteration, an individual x plays a public goods game 
in all groups it belongs to. Cooperative individuals (playing C 
or PC) make a contribution of 1, while non-cooperative indi- 
viduals (D or PD) contribute nothing. Afterwards, the sum of 
all contributions in each group is multiplied with the "synergy 
factor" r, and the resulting amount is equally shared between 
all of its members, irrespective of their contribution. 

Let P* denote the sum of the shares that individual x re- 
ceives in all of the G groups it participates in. Then P* cor- 
responds to the overall payoff of individual x in the absence 
of punishment. This payoff is modified by punishment fines 
and punishment costs as follows: If s^; = D or PD, player 
X is punished with a fine / in such a way that the remaining 
payoff is, Px = Px ^ Yl f'^p- Herein, the sum runs over all 
the groups containing player x. iTp is given by the number of 
punishers (PC or PD) in each group (not considering player 
x), divided by G — 1. Moreover, if Sx = PC or PD, player 
X invests a punishment cost c such that the finally remaining 
payoff is Px — Px ~Y1 cTTd- Herein, the sum runs again over 
all the groups containing player x. iTd is given by the number 
of defecting individuals around player x in each group (D or 
PD), divided by G — 1. The division by G — 1 serves to scale 
for the group size G. 



The strategies are updated according to the following 
Monte Carlo procedure: In each elementary step, a player 
X and one of its neighbors y is randomly chosen. For 
both individuals, the payoffs Px and Py are determined 
as described above. It is assumed that individual y imi- 
tates the strategy Sx of individual x with probability W — 
{1 + cxp[{Py — Px) / K]}~ , where K denotes the uncer- 
tainty of strategy adoptions lUSll . Here, we use the value 
K = 0.5. During one full iteration (Monte Carlo step MCS), 
the strategy of each player may be copied once on average. 

Following the work of Traulsen et. al. 1 16], mutation is in- 
troduced as a separate process. Accordingly, a player changes 
his or her strategy randomly (independently of the neighbor- 
hood) with a probability /i, while the above described strategy 
adoption process is executed with probability 1 — /i. In other 
words, in the limit /i ^> 1, the game-specific strategy adoption 
is completely ignored. 

Initially, each player follows a strategy at random. For 
all combinations of cost and fine parameters, the simulations 
were performed for systems with L > 400. Values greater 
than 400 (up to 1600) were chosen in the vicinity of the phase 
boundaries. This served to avoid that small strategy clusters 
would disappear by accident (by chance). The fractions ps 
of individuals using the strategies s were determined after the 
transient time (up to 10^ iterations, depending on the speed of 
convergence). 

In the absence of punishment and mutation, cooperators die 
out at r = 3.74, as can be concluded from Fig. 2 of Ref. 
lUTIl . For lower synergy factors, defectors dominate, while for 
higher values of r, cooperators can survive, or even spread all 
over the system (if r > 5.45). Taking these values as a refer- 
ence. Fig. [U shows a representative phase diagram of the spa- 
tial public goods game with punishment. As the punishment 
fine / is increased, it can be observed that (for intermediate 
values of the punishment cost c) the system goes from a pure 
D phase over a mixed Dh-PC phase to a pure PC phase. If 
the cost of punishment is high (c > 0.51), the mixed Dh-PC 
phase disappears completely, and the system directly changes 
from a pure D to a pure PC phase via a discontinuous phase 
transition. In the other extreme, if the cost of punishment is 
low (c < 0.013), we have an additional area characterized 
by a coexistence of PC and PD (see inset of Fig. [U. Quite 
surprisingly, the second-order free-rider strategy C is not sus- 
tainable for r — 3.5. Only if r is increased, the pure D phase 
becomes a mixed Dh-C phase, which is the only phase where 
non-punishing cooperators can survive 1 13]. For lower values 
of r, the mixed Dh-PC phase vanishes altogether, thus leaving 
the pure D and the pure PC phases as the only sustainable so- 
lutions, with a discontinuous transition between both phases 
when a critical c(/) line is crossed ] 13]. 

The problem of second-order free riders results from the 
fact that pure cooperators bear no punishment cost, while re- 
ceiving the same share of the public good as punishing coop- 
erators (given the spatial strategy configuration is the same). 
This is, why non-punishing cooperators ("second-order free- 
riders") crowd out punishing one under well-mixed condi- 
tions. However, the resulting tragedy of the commons is natu- 
rally resolved in structured populations 11131 [l4ll . There, the 
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FIG. 1: (color online) Phase diagram of the spatial public goods 
game with cooperators (C), defectors (D), and two punishing strate- 
gies, PC and PD, in the absence of mutations (/i = 0) for a synergy 
factor of r = 3.5 (after I13ll ). The inset magnifies the part of the 
phase diagram for small cost values, where the two punishing strate- 
gies PC and PD can coexist. Dashed lines indicate a first-order, solid 
lines a continuous phase transition. 



victory of the punishing cooperators is not based on a di- 
rect competition between the C and PC strategies, but rather 
on their different success in encounters with defectors. Due 
to the fixed, finite neighborhood, both the PC and C strate- 
gies form homogeneous clusters on the spatial grid and are 
exploited by defectors. If the fine is sufficiently large, pun- 
ishing cooperators can overcome defectors, while cooperators 
can not. (Remember that r > 3.74 is needed for cooperators 
to be sustainable in the presence of defectors.) Thus, punish- 
ing cooperators can spread when competing with defectors, 
while non-punishing cooperators are crowded out by them. 
As a consequence, second-order free-riders (cooperators) dis- 
appear, while punishing cooperators take over. 

In the following, we investigate how robust this mechanism 
based on the clustering and segregation is with respect to strat- 
egy mutations. We proceed similarly as in Ref. ]4ff], but for a 
spatial setting and considering punishing defectors rather than 
loners. For each phase displayed in Fig.[T] we find the follow- 
ing typical behavior: Small mutation rates do not significantly 
change the strategy distribution as compared to the mutation- 
less case. However, for /i « 10^^ or higher, the fraction of 
defectors increases quickly to values close to 1 , as mutations 
generate kind of well-mixed conditions, then. Finally, in the 
limit /i w 1, mutations dominate the dynamics, leading to 
an equidistribution of strategies (i.e. the fraction of defectors 
drops again). Figure |2] shows a typical example for the PC 
phase, where the dominance of punishing cooperators is sus- 
tained until approximately /i « 10^''. 

Naturally, the value of the mutation rate, beyond which 
defectors can efficiently spread, is highly dependent on the 
//c-ratio. Increasing the fine / can reduce the impact of mu- 
tations, because this strengthens punishing cooperators com- 
pared to defectors. Nevertheless, sufficiently high value of 
jj, eventually promote the spreading of defectors through the 
creation of a kind of well-mixed state. As emphasized before, 
a successful spatial clustering and segregation of strategies is 
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FIG. 2: (color online) Fractions of all four strategies in dependence 
of the mutation rate /i for c — 0.6 and / — 0.6, which for ^ = 
lies in the PC phase. Filled green circles represent punishing coop- 
erators (PC), open blue circles cooperators (C), filled yellow squares 
punishing defectors (PD), and open red squares defectors (D). 



a precondition for the spreading of cooperative behavior and 
punishment in the public goods game. 

While the introduction of moderate mutations does not sig- 
nificantly affect the final outcome of the competition between 
strategies, this does not necessarily apply to the dynamics, par- 
ticularly when both cooperative strategies (PC and C) become 
equivalent after the extinction of defecting strategies (D and 
PD). When non-punishing compete with punishing coopera- 
tors, a slow logarithmic coarsening (in the absence of surface 
tension) takes place, which is equivalent to the dynamics of 
the voter model. Despite the slow dynamics, the fixation to the 
absorbing PC phase is relatively fast, because, after the extinc- 
tion of defectors, the fraction of punishing cooperators is high 
compared to the fraction of cooperators. This is a direct conse- 
quence of the greater success of punishing cooperators in the 
competition with defectors during the early stages of strategy 
competition, when the punishment cost is large enough. How- 
ever, if the fractions of punishing and non-punishing cooper- 
ators were about the same and no defecting strategies were 
present, it would require exceptionally long to reach any of 
the absorbing states (C-only or PC-only). Such a scenario is 
illustrated in Fig.[3](solid green line), where the initial fraction 
of punishing cooperators is assumed to be 0.6 and the fraction 
of cooperators is assumed to be 0.4. 

Remarkably, even if a tiny mutation rate is introduced, 
which occasionally creates defectors and punishing defectors, 
this generates an enormous advantage of punishing coopera- 
tors over cooperators in the battle with defecting strategies. 
The presence of defecting strategies destroys the equivalence 
of cooperators and punishing cooperators and breaks the bal- 
ance of power in favor of punishing cooperators. This results 
in a striking acceleration of the spreading of punishing coop- 
erators, as depicted by the dashed green line in Fig. [3] It is 
notable that a mutation rate as tiny as /i = 0.000001 evokes 
such an enormous change in the system dynamics. 

The mutation-induced acceleration of the coarsening pro- 
cess relies on the same effect that creates the dominance of 
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FIG. 3: (color online) Time evolution of the fraction ppc of punish- 
ing cooperators for a small mutation rate of fi — 10~® (dashed green 
line in panel (a)), and in the absence of defecting strategies and muta- 
tions (solid green line). In both cases, the initial state was assumed to 
consist of 60% punishing cooperators and 40% cooperators. Panel 
(b) shows the time evolution of the fraction pD of defectors in the 
presence of small strategy mutations (p = 10~®). Curves are aver- 
ages over 100 independent runs for a grid of size 1600 x 1600. 



punishing cooperators over cooperators. It gives rise to a 
Dh-PC phase and, solves the second-order free-rider problem 
due to the disappearance of non-punishing cooperators. When 
defectors occur in the vicinity of cooperators, they can spread 
efficiently because of the low value of r. Defectors, however, 
cannot succeed against punishing cooperators, if the fine is 
sufficiently high. Consequently, punishing cooperators spread 
at the cost of defectors, while these crowd out cooperators. As 
a consequence, a quick victory of punishing cooperators over 
cooperators requires an interaction between three strategies: 
C, PC and D. The snapshots of Fig. |4] demonstrate the coars- 
ening process impressively for the cases with mutation (bot- 
tom row) and without (top row). Starting with identical spatial 
distributions of cooperators and punishing cooperators, it can 
be observed that the fractions of the two strategies remain al- 
most the same, when no mutations take place. However, in 
the presence of a small rate of strategy mutations (/i = 10~^), 
defectors can temporarily spread in the population at the ex- 
pense of cooperators. This, in turn, provides conditions for 
a fast spreading of punishing cooperators at the expense of 
defectors. This dynamics replaces the slow logarithmic coars- 
ening in the absence of defectors. Notably, once punishing 
cooperators take over the majority of the spatial grid, a sig- 
nificant fraction of defectors can no longer exist. This can be 
seen on the bottom of Fig. |4]as well as in panel (b) of Fig. [3] 
which demonstrates the temporary uprise of defectors just be- 
fore punishing cooperators prevail in the system. 

In summary, we have studied the evolution of coopera- 
tion in public goods games with mutation and punishment, 
where punishing cooperators and punishing defectors were 




FIG. 4: (color online) Typical snapshots of the simulation grid after 
0, 900, 1,200, and 10,000 iterations during the coarsening process 
with strategy mutations (bottom) and without (top). The initial con- 
figurations are identical, and the parameter values agree with those 
of Fig.H] All panels show a 100 x 100 part of the 1600 x 1600 grid. 
Red (grey) sites correspond to defectors, green (light) ones to pun- 
ishing cooperators, and blue (dark) ones to cooperators. Punishing 
defectors cannot survive. It can be clearly seen that the presence of 
defectors due to strategy mutations largely accelerates the spreading 
of punishing cooperators. 



taken into account besides conventional cooperators and de- 
fectors. Considering structured populations naturally solves 
the second-order free-rider problem by spatially separating 
the interaction of cooperators with defectors and of defec- 
tors with punishing cooperators. Since punishing cooperators 
are able to outperform defectors at sufficiently large punish- 
ment fines and defectors are superior to cooperators, punish- 
ing cooperators are the winners of the strategy competition in 
space. This mechanism is robust to modest mutation rates. 



while large mutation rates create kind of well-mixed interac- 
tions, which promote a spreading of defectors and, thereby, a 
tragedy of the commons. Naturally, in the limit /i — !■ 1 mu- 
tations become so strong that they create a game-independent 
random strategy distribution. 

Despite the robustness of the final outcome to moderate 
mutation rates, we could demonstrate that even tiny mutation 
rates can have an enormous impact on the evolutionary dy- 
namics, particularly when punishing cooperators would other- 
wise compete with cooperators only. When no other strategies 
are present, punishing and non-punishing cooperators receive 
the same payoffs, which leads to a slow logarithmic coarsen- 
ing as in the voter model. The occurrence of defectors through 
strategy mutations breaks the balance of power between the 
two cooperative strategies. This can dramatically accelerate 
the spreading of punishing cooperators. Note that, in many 
systems, mutations lead to a different outcome. In the model 
studied here, however, mutations have an effect like a catalyst: 
they speed up a process while the outcome of the system is not 
affected. 
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